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Abstract 

It is proposed a new code for contours of black-white plane images. 
This code was applied for optical character recognition of printed and 
handwritten characters. One can apply it to recognition of any visual 
images. 
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1 Introduction 

A general approach to character recognition via critical points was presented in 
joint papers of the author with G. Nepomnyashchii p],[2] in 1991. These papers 
present a theoretical background of the critical point approach and inform on 
the first results obtained in its implementation. But these papers do not present 
the concrete code which was realized by programming. The reason is that this 
code was a commercial secret in that time. But now the author has finished 
his software business and has no more obstacles to open the code to the public. 
Let us call this code CRIPT-code, this was the name of OCR-programs which 
was produced under the direction of the author in the years 1990-1995 in some 
small russian software firm named " Scriptum" . 

"This work is supported by the Russian Government project 11.G34.31.0053. and Russian 
Foundation of Basic Researches grant 11-01-00822 
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2 Topological background 



The following exposition one can consider as a 1-dimensional version of the 
Morse theory [3]. This theory is very elementary. That is why we omit some 
simple proofs. For practical work one does need proofs, that is the reason to 
omit a long proofs. 

We always suppose that on plane is fixed a system of Cartesian coordinates. 
The second coordinate will be called height. The set of all points of the plane 
which has a fixed height ft will be called a h-level line or h-level. Each /i-level 
line divides plane into two semiplanes which we will call h-upper and h-lower. If 
H > ft the intersection of iJ-lower and ft-upper semiplanes is called horizontal 
[ft, #]-band. 

2.1 Height classification of arcs 

The reader can understand the word arc as a broken line without self-intersection, 
although almost all consideration are valid for an arbitrary Jordan curve. Closed 
arc is an arc without ends and nonclosed one has two ends. The projection of an 
arc on the second coordinate is said to be height range of the arc. The heights 
of the ends of an arc will be called ends levels. The arcs which are of interest are 
connectivity components of intersection of the boundary of some plane domain 
with a horizontal band. The ends of such arcs belongs to the boundary of the 
band. This motivates the following definition. 

An arc will be called [ft, H]-bounded if it is contained in the [ft, -ff]-band and 
both its ends lyes on the boundary of the band (i.e. on the levels h and H). 

We will divide [ft, ii]-bounded arcs into four height's types corresponding to 
letters B, C, D, O. 

The type C is associated with the word " Crossing" . An arc a has the type 
C or that it is a crossing if the ends of its height's range coincides with its ends 
levels. 

And we will call an arc a as [ft, H]- crossing if it is a crossing with the height 
range [ft, H]. 

The type B is associated with the words " Birth" . An arc a has the type B 
if it is contained in ft-upper semiplane and both its ends have the same height. 

The type D is associated with the words "Death". An arc a has the type 
D if it is contained in the ft-lower semiplane and both its ends have the same 
height. 

And finally an arc is called to have the type O if it is an closed arc because 
closed arcs are O-like. 

2.2 Critical levels 

Let us say that an arc a touches a ft- level if ft is the end of the height range 
of a and is not its ends level. In the case a is contained in the ft-upper (upper 
touching) or ft-lower (lower touching) semiplane. 
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A number h is called a critical height of an arc a and /i- level is called a- 
critical if there is such subarc a' C a, that touches /i-level. 
An arc a without critical heights is are called uncritical. 
The proof of the following lemmas presents a simple exercise on definitions. 

Lemma 2.1 Every uncritical arc is a crossing. 

Lemma 2.2 Suppose that an [h, H]-bounded arc a has at most one critical level 
g. In the case the intersection of a with the g-level is connected. 

2.3 Crossing decomposition 

Throughout the section we consider a bounded domain denoted by S with 
boundary denoted by dS. We do not suppose S to be connected. It may 
contain a finite number of connectivity components. We will suppose that the 
boundary is piecewise linear. 

A horizontal [ft,,_ff]-band is called regular with respect to the domain S 
(shortly S-regular) if h and H are not SS'-critical. In the sequel we will consider 
only S'-regular bands. The intersection of OS with the horizontal [h, i?]-band is 
denoted dS[h, H]. The components of the intersection dS[h, H). are [h, H]- arcs. 

Let us say that a subarc a' of an arc a is its crossing component if a' is 
crossing and any subarc of a containing a' is not crossing. 

Lemma 2.3 If an arc a has type B, D or O it contains two crossing compo- 
nents, these components have the same range as a and its union coincides with 
a. 

For every crossing component a of dS[h,H] let us introduce two numbers. 
The first one we denote by L(a) and name lower position, this number is equal 
to if a has not an end on the level h and is equal to the number of crossing 
component of dS[h, H] which have an /i-level end which lies on the left of the 
ft,- level end of the a. The second number is denoted by U(a) named as upper 
position, it is defined as if a has not an iJ-level end and is defined as the 
number of crossing component of dS[h, H] which have an iJ-level end on the 
left of the if- level end of a. 

Lemma 2.4 Let hi < h% < hs be three S -uncritical levels. Let a be a cross- 
ing component of dS\hi,h2\ with U(a) > and /3 be a crossing component of 
dS[h2,hz\. The intersection a n j3 is nonempty iff U(a) — L(f3). In this case 
a U /? is a crossing component of dS[h\, hs] 

2.4 CRIPT-code of the domain 

Let us say that a [h, H ]-band is S-unicritical if it is S'-uncritical or if the segment 
[h, H] contains only one 5-critical height in its interior. Let us remark that for 
this case no connectivity component of dS[h, H] has the type O. 



4 



Let a and /3 be two disjoint crossing component of dS[h, H]. Let us say that 
a is on the left from f3 if the intersection of a with the critical level is on the 
left of the intersection of /3 with the critical level. (By virtue of the lemma 12.21 
both intersection are points or intervals, hence the relation "on the left from" is 
defined correctly). If a and f3 are intersecting crossing components of dS[h, H] 
then its union is an [h, _ff]-arc of the type B or D. If the type is B let us say 
that a is on the left from (3 if L(a) < L((3). If the type is D let us say that a 
is on the left of /3 if U(a) < U(/3). 

For an 5-unicritical [ft., -ff]-band and for every crossing component a of 
0S[h.H] one define its position P(a) as the number of crossing components 
of dS[h, H] which are on the left from a. 

To define the CRIPT-code of the dS[h,H] let us consider the sequence 
ai,a2, . . • A n of crossing components of dS[h,H] ordered by increasing of the 
position. 

For every crossing component on let us denote by T(Ai) the type of connec- 
tivity component of dS[h, H] to which it belongs. 

The sequence T{ai), T(a 2 ), ■ ■ ■ , T(a n ) is called is called [ft, H]-CRIPT-string 
of the domain S. 

CRIPT-string which contains the only type C is called trivial. The trivial 
CRIPT-string one obtains if the band is uncritical. 

Let us say that an increasing sequence of numbers hi, ft 2 , • ■ ■ h rn is an uni- 
critical decomposition of S, if S is contained in the band [hi, h m ] and all bands 
[hi,hi + i] are unicritical. 

For any unicritical decomposition hi, hi, ■ ■ ■ h m of S let us define [hi, ■ • • h m }- 
CRIPT code of S as the sequence of [hi, /ii + i]-CRIPT strings separated by ;. 

An unicritical decomposition hi, hi, ■ ■ • h m of S with minimal possible m is 
called minimal decomposition. The CRIPT-code of a minimal decomposition of 
5* is called minimal CRIPT-code of S. This code does not depends of the choice 
of a minimal decomposition and is uniquely defined by the domain. One can ob- 
tain the minimal CRIPT-code using any unicritical decomposition hi, hi, ■ ■ ■ h m 
simply eliminating in the [hi, hi, ■ ■ ■ ft m ]-CRIPT code trivial strings. The proof 
of these facts easily follows from the following lemma. 

Lemma 2.5 Let two bands [hi, Hi] and [/i2,i?2] be both S -unicritical and con- 
tain the same S-critical level. In this case [hi, Hi]- CRIPT-string and [h2,H2]- 
CRIPT-string coincide. 

Proof. 

Let us consider at first the case hi — hi = h and Hi < H-2- The band 
[Hi, H2] does not contain iS-critical levels, and by virtue of lemma 12. II the inter- 
section of OS with the band consists only of [Hi, iJ 2 ]-crossings. For every end 
a of a connectivity component a of dS[h, Hi] on the level Hi there is unique 
[Hi, 7?2]-crossing which contains a. Adding to a this crossing one obtains an 
connectivity component of dS[h, H2], which has the same type. And it is ob- 
vious that such operation conserve the position. Hence [h, ffi]-CRIPT-string 
coincides with [h, iJ2]-CRIPT-string. 
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If hi > h% the above arguments prove coincidence of CRIPT-strings for 
bands [hi,H 2 ] and [hi, Hi]. The coincidence of CRIPT-strings for bands [hi,H 2 ] 
and [h 2 , H 2 ] can be proved by the same arguments. The lemma is proved. 

The above lemma justifies the following definition. Let h be a critical level 
for dS, let us define /i-CRIPT-string of the domain S as [Hi, if 2 ]-CRIPT-string 
for some unicritical band [Hi,H 2 ] containing h. 

As easy to see the minimal CRIPT-code of S coincides with the sequence of 
its CRIPT-strings corresponding to all its critical levels taken in the order of 
decreasing of level. 

The CRIPT-code of a domain can be represented as a word of the alphabet 
B, C, D, ;. For example the letters A, B, O has the following CRIPT-codes [j] 
BB;CBBC;CDDC;CBBC;DDDD (letter A) 
BB;CBBC;CDDC;CBBC;CDDC;DD (letter B) 
BB;CBBC;CDDC;DD (letter O). 

One can prove the following theorem which justifies the application of min- 
imal CRIPT-code for recognition. 

Theorem 2.1 Two domains has the same minimal CRIPT-code iff there ex- 
ist such preserving orientation homeomorphism of its interiors which preserves 
height-levels, i.e. if two points have the same height its images have the same 
property. 

3 Bit-Maps Coding 

Theoretically to produce the CRIPT-code of a digital picture, which has the 
form of a bit-map it is sufficiently to indicate a way how to associate with it 
a plane region and then to consider the CRIPT-code of its boundary. This 
construction may be performed in the following way: at first one assigns to the 
bit-map the finite set of points with integral coordinates, the first coordinate 
is the number of the column of a black pixel (black corresponds to 1, and 
white corresponds to presents the background of the picture), the second is 
the number of its string. At second one assigns to each such integral point 
the rectangle which has it as its center of symmetry and has sides parallel to 
coordinates, such that its height is equal to 1 and its width is equal 1.1. Now 
the associated domain can be defined as the union of the rectangles associated 
with black pixels. The CRIPT-code of a bit-map is defined as the CRIPT-code 
of associated domain. 

In practice the generation of CRIPT-code is performed during the first scan- 
ning of the bit-map image simultaneously with dividing of the image into con- 
nectivity components, which is a necessary operation for any OCR-program. 

Let us remark that all critical levels of the associated domain has fractional 
part 1/2. Therefore all bands of the type [k, k + 1] for a natural k are unicritical 
and one can apply it to calculate the CRIPT-code of the associated domain. To 
calculate a CRIPT-string corresponding to level k+1/2 one has to consider fc-th 

lr The code of a letter is the code of some its small neighborhood 
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and (fc + l)-th strings of the bitmap. The practical algorithm produces from 
a string of a bit-map the switching sequence which represent the sequence of 
natural numbers m, n 2 , ■ ■ ■ rik which correspond to positions of switching from 
white to black and from black to white. So ri\ is the number of position of 
the first (from the left) black pixel, and rik — 1 is the number of position of 
the last black pixel. If ni,n 2 , ■ ■ .rik is the switching sequence of fc-th string 
and mi , 777,2 , • • • mi is the switching sequence of the next string we can construct 
CRIPT-code for this pair, which corresponds to the level (k + 1/2). Namely, all 
pairs of i?-codes of considered band are in one-to-one correspondence with such 
4-tuples 777^, 777i + i, Tij, 7ij + i which satisfy one of the following conditions: 

1. if i is odd then j is even and rij < rrij < 777^+1 < rij + i 

2. if i is even then j is odd and 77j <nij < ttt^+i < 77j+i 

Dually, all pairs of D-codes correspond to 4-tuples m,, mj+i, rij, rij+i which 
satisfy one of the following conditions: 

1. if j is odd then i is even and m, < rij < rij + \ < m i+ i 

2. if j is even then i is odd and 777^ < rij < rij + \ < m i+ i. 
All C codes corresponds to such pairs rij , rrii that 

1 . i + j is even and rij = mt 

2. if i is odd then j is odd mj_i < 77^ < rrii < rij + i or 77j_i < rrn < rij < 
m i+1 

3. if i is even then j is even ?77j_i < 77^ < rrii < rij + i or rij-i < m, < 77^ < 
m i+1 

4 Structure of CRIPT-code 

The CRIPT-code of a domain S represent a word of the CRIPT-alphabet 
{-B,C, -D,;}. Let us say that a word of CRIPT-alphabet is correct if there 
is such a domain which has this word as its CRIPT-code. Let us find condi- 
tions which characterize the correctness. To do it we have to introduce some 
definitions. For any word w by tz>[£;,77] we denote its subword which start with 
fc-th letter of w and terminates at the n-th letter of it. So w[k, k] is the fc-th 
letter of w. If k > 77 we pose w[k,n] is the empty word. Let us denote by B(w), 
C(w), D(w), E(w) correspondingly the number of letters B, C, D and ; in it. 
So E{c) is the number of strings in a CRIPT-code. Supposing that E{w) = — 1 
for empty word one can define the fc-th string of any word w as w[n,m], where 
E(w[l,n-2\) < E(w[l,n-l\) = E(w[l,m}) < E(w[l,m+1}) = k. The notation 
for the fc-th string of w is w[k]. 

The first obvious condition is that the first string of a correct word contains 
only letters B. One can write this condition as i?(w[l]) > 0, C(w[l]) = and 
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D(w[l}) — 0. The dual condition is that the last string contains only letters D. 
Let us name this conditions boundary conditions. 

The second pair of conditions we name evenness conditions. If before a 
symbol of the code of the type B or D there are even number of the same 
symbols then the next symbol of the code has the same type. For the letter B the 
evenness condition one can wright as follows: if B(w[l, n]) — 1 = B(w[l, n — 1]) 
is an even number then B(w[l,n + 1]) = B(w[l,n]) + 1. 

Let us say that two symbols of the same type B or D are conjugated to 
each other if before the first of them one has even number of the same sym- 
bols and the second is the succeeder of the first one. Hence in a correct code 
all symbols of types B and D are divided into pairs of conjugated symbols. 
Geometrically conjugated symbols represent pairs of crossing component of the 
same connectivity component. 

The third type of conditions we will name balance conditions. Let hk-i, 
hk and hk+i denotes such levels that the bands [hk-i,hk] and [hk,hk+i] are 
S'-unicritical and its CRIPT-strings represent fc-th and (k + l)-th strings of the 
CRIPT-code w of S. 

The number of component of intersection of dS with the level h from one 
side is equal to B(w[k]) + C(w[k)) because the last number is the number of 
ends of crossing component of dS[hk, hk+i]- And from the other side it is equal 
to D(w[k + 1]) + C(w[k + 1]) because this number is equal to the number of 
ends of crossing components of dS[hk-\, hk]. As result one obtains for every k 
the following balance equality: 

B(w[k}) + C{w[k]) = D(w[k + 1]) + C(w[k + 1]) 

Summing the balance equalities one obtains the following global equality 
B(w) — D(w) for every correct word w. 

4.1 Contactness 

Let v and w are two one-string words (E(v) = E(w) — 1)) which satisfy the 
balance equality B(v) + C(v) = D(w) + C(w). For such strings we introduce the 
following contactness relation. We will say that a letter of v on fc-th position 
contacts with a letter of w on the n-th position if the type of the first letter is B 
or C, the type of the second letter is C or D and B(v[l, k — 1]) + C(u[l,fc — 1]) = 
C(w[l,n — 1]) + D(w[l,n — 1]). The geometrical meaning of the contactness 
was demonstrated by the lemma 12.41 The contacting codes for CRIPT-strings 
of adjancent unicritical bands corresponds to intersecting crossing components. 
Now we are ready to outline the proof of the following realization theorem. 

Theorem 4.1 Every word w of the CRIF T- alphabet which satisfies the bound- 
ary conditions, the evenness conditions and balance conditions is correct. 

Proof. 

To construct the domain which has the given word w as its CRIPT-code 
let us chose an increasing sequence hi, /i2j • • • , h n , where n = E(w). On every 
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ft.j-level let us choose the sequence P%,\, Pi. 2, • • • , Pi.k(i) of points where k(i) is 
the number of letters in w[i\. Suppose that these points are ordered by the 
increasing of the first coordinate. 

Now let us join by right segments all pairs of points corresponding to conju- 
gated codes and join all pairs of contacted points. We omit the proof that the 
constructed domain has the CRIPT-code equal to w. 

From the above considerations one can get an algorithm of extracting of 
CRIPT-codes of connectivity components from the CRIPT-code of an discon- 
nected domain. Indeed, two letters of the CRIPT-code of the domain belongs 
to the same connectivity component iff there is a sequence of letters of the code 
such that the first term of the sequence coincide with the first given letter, the 
last one coincide with the second given letter and for any letter of the sequence 
its succeeder is or conjugated or contacted to it. 

4.2 Elimination of distortions 

And another practical problem is elimination of distortions. To do it one can 
clusterize the critical levels and separate clusters to generate the CRIPT-code. 
In this case one has to work with CRIPT-strings of bands which are not unicrit- 
ical. For this bands can arise arcs of type O. One can encounter a difficulties 
to order crossing components in the case because some crossing component can 
have disjoint height ranges. But all these difficulties have a practical solutions. 
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